FISHER INFORMATION AND CONVERGENCE 
TO STABLE LAWS 



S. G. BOBKOVi'4, G. P. CHISTYAKOV^, AND F. GOTZE^ 

Abstract. The convergence to stable laws is studied in relative Fisher information 
for sums of i.i.d. random variables. 



1. Introduction 

Let {Xn)n>i be independent identically distributed random variables, and define the 
normalized sums 

_ Xi + --- + X„ 

On 

for given (non-random) normalizing sequences a„ G R and 6„ > 0. Assuming that Zn 
converges weakly in distribution to a random variable Z with a non-degenerate stable 
law, we consider the Fisher information distance 

where p„ respectively ip denote the densities of Z„ respectively Z. The definition makes 
sense, if pn is absolutely continuous and has a Radon-Nikodym derivative p'^. Otherwise, 
one puts I{Zn\\Z) = +00. 

If Xi has finite second moment with mean zero and variance one, the classical central 
limit theorem is valid, that is, Zn ^ Z (weakly in distribution), with a„ = 0, 6„ = ^/n, 
where Z is standard normal. In this case a striking result of Barron and Johnson [B-J] 
indicates that I{Zn\\Z) — )• 0, as n — )■ 00, as soon as I{Zn\\Z) < +00, for some n, that 
is, if for some n, Z„ has finite Fisher information 

/(z„) = dx. 

J -00 Pn{x) 

This observation considerably strengthens a number of results on the central limit 
theorem for strong distances involving the total variation and the relative entropy. It 
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raises at the same time the question about possible extensions to non-normal limit stable 
laws (as mentioned e.g. in [J], p. 104). The question turns out to be rather tricky, and it 
is not that evident that I{Zn) needs to be even bounded for large n (a property which 
is guaranteed by Stam's inequality in case of a finite second moment). 

The present note gives an affirmative solution of the problem in case of the so-called 
non-extremal stable laws (cf. Definition 1.2 below). In the sequel, we shall consider 
non-degenerate distributions, only. 

Theorem 1.1. Assume that the sequence of normalized sums Z„ defined above 
converges weakly to a random variable Z with a non-extremal stable limit law. Then 
I{Zn\\Z) — >■ 0, as n ^ oo, if and only if I{Zn\\Z) < +00, for some n. 

The normal case is included in this assertion. Note, however, that if Xi has an 
infinite second moment, but still belongs to the domain of normal attraction, we have 
I{Zn\\Z) = -\-oo for all n. Hence, in this special case there is no convergence in relative 
Fisher information. 

In the remaining cases Z has a stable distribution with some parameters < a < 2, 

— 1 < /3 < 1, with characteristic function f{t) = Ee**^ described by 

f{t) ^ex.p{iat-c\t\"{l + i/3sign{t)uj{t,a))}, (1.1) 

where a e R, c > 0, and u(t,a) = tan(^) in case a ^ 1, and uj{t,a) = | log |t| for 
a = 1. In particular, |/(t)| = e^'^'*'" which implies that Z has a smooth density ^{x). 
A stable distribution is called non-extremal, if it is normal or, if < a < 2 and 

— 1 < ^ < 1. In the latter case, the density ^ of Z is positive on the whole real line and 
satisfies asymptotic relations 

ij{x) ~ Co |a;r(^+") {x -00), '0(^) ~ ci x'^^'^"''^ (x +00) (1.2) 

with some constants Cq, Ci > 0. 

The property that Xi belongs to the domain of attraction of a stable law of index 
< q; < 2 may be expressed explicitly in terms of the distribution function Fi{x) — 
P{Xi < x}. Namely, we have Z with some bn > and a„ e R, if and only if 

Fi{x) = (co + o(l))|x|-"S(|x|) (x^-00), (1.3) 
1-Fi{x) = {ci + o{l))x-'^ B{x) (x^+oo), (1.4) 

for some constants Cq, ci > that are not both zero, and where B(x) is a slowly varying 
function in the sense of Karamata. This description refiects a certain behaviour of the 
characteristic function fi{t) = Ee'^^^ near the origin (cf. [I-L], [Z]). 

In connection with Theorem 1.1, note that a similar assertion has recently been 
proved in [B-C-Gl] for the relative entropy 

D{Z„\\Z)^ r^p„{x)log^dx. 
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Namely, it is shown that D{Zn\\Z) — )■ 0, if and only if ^ Z and D{Zn\\Z) < +00 
for some n (thus extending Barron's entropic central limit theorem, [B]). In the normal 
case, it is known that, if 'EXi — 'EZ and Var(Xi) = Var(Z) = cr^, then 

2 

^I{Z,,\\Z)>D{Z,,\\Z). (1.5) 

Hence, the convergence in Fisher information distance is a stronger property than in 
relative entropy. The question how these two distances are related to each other with 
respect to other stable laws does not seem to have been addressed in the hterature. 
Apparently it is a question about the existence of certain weak logarithmic Sobolev 
inequalities for probability distributions with heavy tails, and we do not touch it here. 
However, it is natural to conjecture that the situation is similar as in the normal case 
via a suitable analogue of (1.5). 

Another obvious question concerns the description of distributions satisfying the 
conditions of Theorem 1.1. In the non-normal case, the property J(Z„||Z) < +00 
may be simplified to I{Zn) < +00. Taking, for example, n = 1, we obtain I{Xi) < +00 
as a sufficient condition. This is however a rather strong condition, which may be 
considerably weakened by choosing larger values of n. One may wonder therefore what 
assumptions need to be added to (1.3)-(1.4) in terms of Fi or fi to obtain the convergence 
of Zn to Z in relative Fisher information. A direct characterization may be given in 
terms of the behaviour of /i at infinity, at least in one particular case. As shown in 
[B-C-Gl], if Xi has a finite first absolute moment, the property /(^„) < +00 for some 
n is equivalent to any of the following two conditions: 

a) For some £ > 0, \fi{t) \ = 0{t~^), as t ^ +00; 

b) For some u > 0, 

/+00 
1/1(^)1^^2^^ < +00. (1.6) 
-00 

This characterization may be used in Theorem 1.1 in case 1 < a < 2, since then, by 
(1.3)-(1.4), we have E |Xi|^ < +00, for all < 5 < a. 

When Xi has finite second moment (which corresponds to the case a — 2), (1.6) may 
be replaced by the weaker condition 

/+00 
Ifiit)]" \t\dt < +00, for some i/>0. (1.7) 
-00 

Removing the weight \t\ from the integral (1.7), we obtain another weaker condition. 
It will be equivalent to the property that Zn has an absolutely continuous distribution 
with a bounded continuous density Pn, for some and consequently for any sufficiently 
large n. In that and only that case, the following uniform local limit theorem holds: 
supj. \pnix) — 'ip{x)\ — )■ 0, as n — )■ 00 (provided that Zn ^ Z with < a < 2, cf. [I-L]). 

The paper is organized as follows. First we state some general bounds on Fisher 
information and some properties of densities which can be represented as convolutions 
of densities with finite Fisher information (Sections 2-4). A main result used here has 
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been already proved in recent work [B-C-G3] by the authors. In Section 5 we turn to the 
stable case and discuss a number of auxiliary results such as local limit theorems, as well 
as questions about the behaviour of characteristic functions of Z„ near zero. In Section 

6 we reduce Theorem 1.1 to showing that the Fisher information is bounded in 
n. The subsequent sections are therefore focused on this boundedness problem. Section 

7 introduces a special decomposition of convolutions, and the final steps of the proof of 
Theorem 1.1 can be found in Section 8. Wc shall complement the proofs by comments 
explaining why the condition (1.6) is sufficient for the validity of Theorem 1.1. 



2. General Results about Fisher Information 



If a random variable X has an absolutely continuous density p with Radon-Nikodym 
derivative p', its Fisher information is defined by 

where the integration may be restricted to the set {x : p{x) > 0}. In any other case, 
/(X) = +00. 

If I{X) is finite, then necessarily the distribution of X has to be absolutely continuous 
with density p{x) such that the derivative p'{x) exists and is finite on a set of full 
Lebcsguc measure. Furthermore, one can show that, if I{X) < +oo, then p'{x) = at 
any point, where p{x) = 0. 

It follows immediately from the definition that the /-functional is translation invariant 
and homogeneous of order —2, that is, I {a + bX) = ^ ^i^), for all a G R and 6 7^ 0. 

Since the function u'^/v is convex in the upper half-plane u G R, v > 0, this functional 
is convex. That is, for all densities Pi, . . . ,p„, we have Jensen's inequality 

n n 

l{0LxP\ H h ) < X] ^kHPk) (ttfc > 0, ^ afc = 1) . 

k=l k=l 

The inequality may be generalized to arbitrary "continuous" mixtures of densities. In 
particular, for the convolution 

/+00 
p{x - y)<i{y) dx 
-00 

of any two densities p and q , we have 

I{p *q) < min{J(j9), I{q)}. (2.2) 

In other words, if X and Y are independent random variables with these densities, then 

/(X + Y)< min{/(X), !{¥)}. 

This property may be viewed as monotonicity of the Fisher information: This functional 
decreases when adding an independent summand. In fact, a much stronger inequality 
is available. 
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Proposition 2.1. (Stam [St]) If X andY are independent random variables, then 

1 >^ + ^. (2.3) 



I{X + Y)- I{X) I{Y) ■ 
Let us also introduce the Fisher information distance 



with respect to a random variable Z having a stable law. We need the following el- 
ementary observation, which shows that the question of boundedness of the Fisher 
information I{Zn) and of the Fisher information distance I(Zn\\Z) for the normalized 
sums Zn as introduced in Theorem 1.1 are in fact equivalent. 

Proposition 2.2. If Z has a non-extremal stable law of some index < a < 2, then, 
for any random variable X, 

I{X\\Z) < 2/(X) + c(Z), (2.4) 
I{X) < 2I{X\\Z) + c{Z), (2.5) 

where c{Z) depends on the distribution of Z, only. In particular, I{X\\Z) < +oo, if and 
only if I{X) < +00. 

The assertion is based on the fact that any non-extremal non-normal stable distri- 
bution has a smooth positive density 'ip such that, for all A; = 1, 2, ... , 



|(logV'(x))W|~^^-^ (|x|^+oo) 

Ob 



(cf. [I-L], [Z]). In particular, it yields 



<^<^ (^eR) (2.6) 



1 + \x\ ip{x) l-\-\x 
with some positive constants Cj. Hence, assuming that I{X) < +00, then writing 

\p(x) 'ijj(x) J ~ \p(x)J \'ijj(x)J ~ \p{x)J ^ 
and integrating this inequality with weight p{x), we obtain (2.4). Similarly, 

\p(x)J ~ ^p(x) ijj(x) J ^' 

which leads to (2.5). 

Similar arguments for the normal case {a — 2) however lead to a different conclusion. 

Proposition 2.3. If Z is normal, then I{X\\Z) < +00, if and only if I{X) < +00 
and EX^ < +00. 
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Note that in case where X and Z have equal means and variances, we have /(X| |Z) = 
I{X)-I{Z). 

3. Connection with Functions of Bounded Veiriation 

Applying Cauchy's inequality, one immediately obtains from the definition (2.1) the 
following elementary lower bound on the Fisher information. 

Proposition 3.1. If X has an absolutely continuous density p with Radon- Nikodym 
derivative p' , then 

f + OO 



/+00 
\p\x)\dx. (3.1) 
■oo 



Here, the integral represents the total variation norm of the function p as used in 
Real Analysis, 

n 

IIpIItv = sup ^ \p{xk) -p{xk-i)l 

k=l 

where the supremum runs over all finite collections Xq < Xi < ■ ■ ■ < Xn- 

The densities p with finite total variation are vanishing at infinity and are uniformly 
bounded by ||p||tv- Moreover, their characteristic functions 

/ + 00 
e**^p(a;) dx {t G R) 
-oo 

admit, by integration by parts, a simple upper bound 

l/(t)l<^ (t^O). (3.2) 

Hence, by Proposition 3.1, if a random variable X has finite Fisher information, its 
density p and characteristic function f{t) = Ee**^ satisfy similar bounds 

maxp(x)<v^, |/(t)|<^^^ (t^O). (3.3) 

In general, the inequality (3.1) cannot be reversed, though this is possible for convo- 
lutions of three densities of bounded variation. The following statement may be found 
in [B-C-G3]. 

Proposition 3.2. // independent random variables Xj {j = 1, 2, 3) have densities 
Pj of bounded variation, then S — Xi + X2 + X^ has finite Fisher information, and 
moreover, 

1(3) < ^ |bl||Tv|b2||TV+ |bl||Tv|b3||TV+ |b2||Tv|b3||Tv]- (3.4) 



Fisher information and stable laws 



7 



Note that the convolution of two densities of bounded variation (e.g., corresponding 
to the uniform distribution on finite intervals) may have an infinite Fisher information. 

Remark 3.3. A similar bound on the Fisher information may also be given in terms 
of characteristic functions. In view of (3.4), it suffices to bound the total variation norm, 
which can be done by applying the inverse Fourier formula, at least in case of finite first 
absolute moment. Namely, one can easily show that, if a random variable X has a 
continuously differentiable characteristic function f{t) for t > 0, and 

/ + 00 
tmf{t)\'+\f{t)\')dt<+^, (3.5) 
-oo 

then X must have an absolutely continuous distribution with density p of bounded total 
variation satisfying 

(/•+00 /•+00 \ 1/4 

/ \tf{t)\'dt mm'dt) . (3.6) 
J —oo J —oo J 



4. Classes of Densities Representable as Convolutions 

General bounds like (3.3) may considerably be sharpened in the case where p is rep- 
resentable as convolution of several densities with finite Fisher information. Here, wc 
consider the collection ^2(-^) of all functions on the real line which can be represented 
as convolution of two probability densities with Fisher information at most /. Corre- 
spondingly, let = U7*P2(-^) denote the collection of all functions representable as 
convolution of two probability densities with finite Fisher information. Note that, by 
(2.3), /(p) < iJ, for anypeq32(/). 

Thus, a random variable X = X^^ X^ has density p in if the density may be 
written as 

/+00 
Pi{x - y)p2{y) dx (4.1) 
-00 

in terms of absolutely continuous densities Pi,P2 of the independent summands Xi,X2 
having finite Fisher information. Differentiating under the integral sign, we obtain a 
Radon-Nikodym derivative of the function p, 

/+OO Z'+OO 
P'i{x - y)p2{y) dy ^ / p[{y)p2{x-y)dy. (4.2) 
-00 J —00 

The latter expression shows that p' is an absolutely continuous function and has the 
Radon-Nikodym derivative 

/+00 
p'Mp'2{x-y)dy. (4.3) 
■00 
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In other words, p" appears as the convolution of the functions p'l and p'2 which are 
integrable, according to Proposition 3.1. 

Note that equahty (4.3) defines p"{x) at every individual point x, not just almost 
everywhere (which is typical for a Radon-Nikodym derivative). Using the property 
Pj{x) = ^ Pj{x) = in case of finite Fisher information, we obtain a similar implica- 
tion p{x) = =^ p"{x) = 0, which holds for any x. 

Moreover, a direct application of the inequality (3.1) in (4.3) shows that p' has finite 
total variation 



/+00 
\p"(x)\dx<I. 
-OD 



These formulas may be used to derive various pointwise and integral relations within 
the class ^2 such as the following. 

Proposition 4.1. Given a density p in ^2(1), for all x E'R, 



\p'{x)\ < /^'/^VPR < I- (4-4) 

In addition, 

^ ^ ^ dx < P. (4.5) 



To be more precise, integration in (4.5) is restricted to the set {p{x) > 0}. This 
proposition can be found in [B-C-G3]; since the proof is short, we shall include it here 
for completeness. 

Starting with (4.1), where I{pj) < I, define the functions Uj{x) = i{pj{x)>o} 
{j = 1,2). Applying Cauchy's inequality in (4.2), we get 



P'i^y = (/ Mi(x) ■ V Pi{x - y) P2{y) dy 
J —00 

/+00 
Piix-y)p2iyydy < I{X,)^/I{X2) p{x), 
■00 

where we used P2{y) < ■\/T{X^ on the last step. Hence, we obtain the first inequality 
in (4.4), and the second follows from p{x) < vT. Similarly, rewrite (4.3) as 

+00 



p"i^) = / {uii^ - y)u2{y)) \/pi{x- y)p2{y) dx, 
to get 

/+00 f+00 
ui{x - yf U2{yf dy j pi{x - y)p2{y) dx = u{xfp{x), 
-00 J —00 
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where we define m > by 



/ + 00 
- yfu2{yfdy. 
-oo 



It also follows that 

f + OO 

u{xfdx = I{Xi)I{X2) < /^ 



J — t 



which implies (4.5) and thus proves Proposition 4.1. 

The analytic properties of densities in allow to make use of different formulas for 
the Fisher information (by using integration by parts). For example, 

/ + 00 
p"{x) logp(a;) dx, 
-oo 

provided that the integrand is Lebesgue integrable. 

We will need the following "tail- type" estimate for the Fisher information. 

CoroUairy 4.2. Ifp is in ^2(1), then for any T real, 

^ dx < logp(T)| + / (^y^ p(x) log^p(x) dx) . (4.6) 

Proof. Let us decompose the open set G = {x > T : p(x) > 0} into the union of 
disjoint intervals (a„, 6.„). Note that one of them may be unbounded. Clearly, if Ort > T, 
we have p(a„) = 0, so p'{x)logp{x) — ?> 0, as a; J, a„, by Proposition 4.1. Similarly, 
p{bn) = 0, if bn < +00, and in addition p(-|-oo) = 0. 

Let an < Ti < T2 < bn- Since p' is an absolutely continuous function of bounded 
variation, integration by parts is justified and yields 

f^^p'ixf f^^ f^^ 

/ — -—- dx — / p'{x) dlogp{x) — p'{x)logp{x) — / p"{x) logp{x) dx. 

Jti P[X) Jti x=Ti JTi 

Letting Ti an and T2 — )■ 5„, we get in case an > T 

f>-p'{xf f'- 

/ — ■—— dx = — P [x) logp(j;) dx 

Jan P\X) Jan 



and 



bn fbn 

dx = -p'{T) logp(r) - / p"{x) \ogp{x) dx 

an P[^) Jan 



in case an — T (if such n exists). Anyhow, the summation over n gives 

^-^dx < \p' (T) log p(T)\+ f \p"(x) log p(x)\dx. (4.7) 
P{x) Jg 
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Here the first term on the right-hand side can be estimated by virtue of (4.4), which 
leads to the first term on the right-hand side of (4.6). Using (4.5) together with Cauchy's 
inequality, for the the last integral we also have 

/ —== \/p{x)\ log p{x)\dx\ < p{x)log^ p{x) dx, 

Jg \/p{x) J Jt 

thus proving Corollary 4.2. 

5. Stable Laws and Uniform Local Limit Theorems 

Let us return to the normalized sums 

Zn = ^{Xi + --- + Xr,)-an (a^GR, 6„>0), (5.1) 

associated with independent identically distributed random variables (X„)„>i. In this 
section we discuss uniform limit theorems for densities Pn of Z„ and behavior of their 
characteristic functions near the origin. As before, if Z„ =^ Z, the density of the stable 
limit Z is denoted by ■0. 

Introduce the characteristic functions of and 

h{t) = Ee^*^\ Ut) = Ee^*^" = e''"'- hit/b^T {t e R)- 
To avoid confusion, we make the convention that Zi — Xi, i.e., ai — and bi — 1. 

Proposition 5.1. Assume that Z^^ Z weakly in distribution. If 

/ + 00 
\ flit)]" dt < +00, for some u>0, (5.2) 
-OD 

then for all n large enough, Z„ have bounded continuous densities Pn such that 

lim sup \pni-c) — ip{x)\ = 0. (5.3) 



Proposition 5.2. Assume that Z^^ Z weakly in distribution. If 

/ + 00 
\fi{t)\'' \t\dt < +00, for some i/ > 0, (5.4) 
■oo 

then for all n large enough, Zn have continuously differentiable densities Pn with bounded 
derivatives, and moreover 

lim sup \p'^{x) — ip' {x)\ — 0. (5.5) 



The first assertion is well-known, cf. [I-L], p. 126. The condition (5.2) is actually 
equivalent to the property that for all sufficiently large n, say n > no, Zn have bounded 
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continuous densities Pn- In that case, the characteristic functions are integrable 
whenever n > 2no. Conversely, under (5.2), these densities ior n > u are given by the 
inversion formula 

1 r+oo 

Pn{x)^— e-''^fn{t)dt. (5.6) 



27r„ 

Under the stronger assumption (5.4), the above equality may be differentiated, and 
we get a similar representation for the derivative. 

1 r+oo 

K(^) = 2^y {-tt)e-^'^Ut)dt. (5.7) 

Although Proposition 5.2 is not stated in [I-L], its proof is identical to the proof of 
Proposition 5.1. An important ingredient in the argument is the fact that the weak 
convergence Zn ^ Z forces /i to be regularly behaving near the origin. This fact can 
also be used in the study of the boundedness of the Fisher information distance /(Z„||Z), 
so let us state it separately. 

Proposition 5.3. If Zn ^ Z weakly in distribution, where Z has a stable law of 
index a e (0, 2], then 

|/i(t)|=exp{-c|trMl/|t|)}, (5.8) 
where c > and h = h{x) is a slowly varying function for x +oo such that 

hm ^ = 1. (5.9) 

One of the consequences of Proposition 5.3 is that, given < (5 < a, the characteristic 
functions /„ admit on a relatively large interval the bound 

\fn{t)\ < e-^^'^\'\' {\t\<eh^) (5.10) 

with some positive constants e and c{5) which arc independent of n, cf. [1-L], p. 123. This 
bound shows that the parts of the integrals (5.6)-(5.7) taken over the region T < |t| < ehn 
with fixed T > are indeed small, while the assumptions (5.2) and (5.4) guarantee 
smallness of these integrals taken over the remaining region \t\ > 

In comparison with (5.8) a more precise statement is obtained in [I-L], cf. Theorem 
2.6.5, p. 85. Namely, if Z„ ^ Z, then for all t small enough, 

/i(i) = exp [i^t - c|i|"/i(l/|i|) (1 + i/3 sign(i) uj{t, a))], 

where 7 is real, c > 0, and the parameters a G (0,2], (3 G [—1,1], and the function 
Ci;(t, a) are the same as in the representation (1.1) for the logarithm of the characteristic 
function f{t) of Z. By lengthy computations in the proof of Theorem 2.6.5 in [I-L], it 
was shown that the function B{x) appearing in the asymptotic relations (1.3)-(1.4) and 
the function h{x) are connected via 

h{x) ^ {1 + o{l))B{x), as x ^ +00. 
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Taking into account (5.9), this implies that in the case < a < 2 there is a constant 
c > such that, as n — >■ oo. 



c 



P{|Xi| >6J~-. (5.11) 



n 



Let us return to local limit theorems. From (5.3) and (5.5) we immediately obtain 
the convergence of a "truncated" Fisher information distance. 

Corollary 5.4. Assume that ^ Z weakly in distribution, where Z has a non- 
extremal stable law. If I{Znn) < +oo, for some uq, then for all n large enough, the 
random variables Z^ admit continuously differentiable densities Pn, and for every fixed 
T>0, 

f - ^)%n(a;) dx = 0(1), n ^ oo. (5.12) 

J-T ^Pn{x) 1p{x) J 

The only property of non-extremal stable laws which is used to show (5.12) is the 
fact that ip is everywhere positive. 

By the assumption, we have I{Zn) < +oo, for all n > Uq, and by (3.3), 



with c = ^yi{Zno). Hence, the condition (5.4) is fulfilled with u — Suq. Therefore, we 
get both (5.3) and (5.5), and in particular, Pn{x) > £ > in |x| < T, for all n large 
enough. As a result, the integrand in (5.12) is uniformly bounded by a sequence tending 
to zero. 



6. Moderate Deviations 



As before, for independent identically distributed random variables {Xn)n>i put 
S 

Zn = - an, where Sn ^ Xi -{ \- (a„ e R, 6„ > 0). (6.1) 

It is well known that if Z„ =^ Z, where Z has a stable law of some index < o; < 2, 
then necessarily 

bn = n^^^'hin), (6.2) 

where /i is a slowly varying function in the sense of Karamata. 

To study the behaviour of /(Z„| |Z) in the non-extremal non-normal case, it is worth- 
while noting that this Fisher information distance is finite, if and only /(Z„) is finite 
(Proposition 2.2). In the normal case, I{Zn\\Z) < +oo, if and only if I{Zn) < +oo 
and EZ^ < +oo (Proposition 2.3). The last inequality is equivalent to EXf < +00 
and then for the weak convergence Zn ^ Z with a standard normal limit one may take 
bn — VarXi and a„ = 'EXia/u/ y/VaiXi . 
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In any case, the requirement that /(^no) < +00 imphes that for all n > no, Zn 
have absolutely continuous bounded densities which we denote in the sequel by p„. 
Moreover, G whenever n > 2no, and then p„ have continuous derivatives p'^ of 
bounded variation (as discussed in the previous section). 

As the next step towards Theorem 1.1, we prove: 

Lemma 6.1. Assume that Zn ^ Z weakly in distribution, where Z has a non- 
extremal stable law. If limsup„_^(^ < +00, then 

lim I{Zn\\Z) = 0. (6.3) 



Proof. As before, denote by tjj the density of Z. 

By the assumptions, we have for some Uq and with some constant /' 

I{Sn) < I'bl n > no. 

If n > 2no, write n = ni + n2 with ni = [|], ^2 = n — ni. Then ni > hq and 712 > no, 
and hence 

HSnJ < I'bl < liSn - SnJ < I'bl < 

with some constant / in view of the almost polynomial behaviour of 6„ as described in 
(6.2). Thus, 

a„ + 



bn 

represents the sum of two independent random variables with Fisher information at 
most I. Therefore, p„ G '^2{]^)-! ^or all n > 2no, and we may invoke Corollary 4.2. 

In view of Corollary 5.4 we only need to show that, given s > 0, one may choose a 
T > such that the integral 



\x\>T ^Pn{x) ij{x) 

is smaller than e, for all n large enough. 
Clearly, J < 2Ji + 2 J2, where 

Jx = / = / ^Yfi Pn{x) dx. 

J\x\>T Pn{x) J\x\>T^ij{x) J 

Recall that ^-^^^^ < yqq^ with a constant c depending on ■0, only (cf. (2.6)). Hence, 
which thus can be made as small, as we wish. 
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It remains to estimate Ji. We now apply (4.6) giving 

Jl < /^/^ (vP^I logPn(T)| + VPn(-T) I l0gp„(-r)|) 

+ 2l( [ pn{x) log'' pn{x)dx] . (6.4) 

\J\x\>T / 

Using the uniform local limit theorem in the form (5.3) together with the asymptotic 
relation (1.2) for ■0(x) at infinity, we easily get 

VPn{±T) I logp„(±T) I < c ^^ll^ + £„, (6.5) 

which holds for all sufficiently large n and all T >Tq with — > (as n — >■ oo) and with 
constants c > and Tq > 10 depending on ip, only. 

To bound the integral in (6.4), we partition {x : \x\ > T} into the set 

A — {x : \x\ > T, Pn{x) < 

and its complement B. By the definition, 

I Pn{x) log' Pn{x) dx < 16 f {x]'"^ log' \x\ dx < —. (6.6) 

On the other hand, Pn are uniformly bounded, namely, supp„(a;) < \/7, for all n > 2no 
(cf. (3.3)). Hence, on the set B, 

I logpJx)\ < log — -— + I log V7 I < 41og \x\ + I log /I, 

Pn{x) 



and therefore 

/ Pn{x) log' Pn{x) dx < C Pn{x) log' \x\ dx, (6.7) 

Jb J\x\>T 

where the constant depends on /. 

Finally, we use the property that the moments E iZnl*^ are uniformly bounded in n, 
whenever < 6 < a (cf. [I-L], p. 142). Choosing 6 = a/2 and using an elementary 
bound > Cq, log^ |a;| (where > Tq > 10), we obtain with some constant K that 

i^>E|Z„r/2 > T"/4E|Z„r/'^l|,^„|>r} 

= /" \x\'^/^ Pn{x) dx > CaT'^/^ [ Pn{x) log' \x\ dx . 

J\x\>T J\x\>T 

Thus, the second integral in (6.7) may be bounded by cT^"^^ with some constant c 
independent of n. Combining this with (6.6), we obtain a similar bound for the integral 
in (6.4), and taking into account (6.5), we get Ji < cT~°'/^ + £„. This completes the 
proof of Lemma 6.1. 
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7. Binomial decomposition of convolutions 

To show that the assumption in Lemma 6.1 holds as long as /(^no) < +oo, for some 
no, we introduce a special decomposition of densities Z„. To simplify the argument, 
assume no = 1, so that I{p) = I{Xi) < +oo, where p denotes the density of Xi. In fact, 
we only consider the shifted normalized sums 



and for the notational convenience, denote their densities by Pn- Note that, by the 
translation invariance, = I{Zn)- 

Keeping the same notations as in the previous sections, we use a suitable truncation 
(which is actually not needed in case a > 1). Introduce the probability densities 

^ 1~ZV^(^"^) 1{NI<1}' = ^p{bnX) l{\x\>l} 

-l On Ofi 

together with their characteristic functions 

fnit) = — ^ / e^'^/'M^) dx, g„{t) = - / e^'^/'M^) dx, 

J- 0,1 J -bn On J\x\>bn 

where 6n = J^^^^i, p{x) '^^ ~ f with some constant c > 0, as emphasized in (5.11). 
Then we have a binomial decomposition for convolutions 

p„ = ((1 - Sn)Pn + SnQnT* = E " '^")' ^n'" Pn * (7-1) 

k=0 ^ ^ 

Note that each convolution p^* * ''^* appearing in this weighted sum represents a 
probability density with characteristic function fni't)'^ gn{'t)^~'^ ■ 

In this section we establish some properties of /„. which will be needed in the proof 
of Theorem 1.1. The corresponding density Pn is supported on [—1, 1], however, it does 
not need to have mean zero. So, put 

'■fen 



xp{x) dx, 

bn 



and define 

which is the characteristic function of the centered random variable ^ — dn, when ^ has 
density pn- Thus, ipn corresponds to the density r„(a;) = p„(x + dn), with ip'n{0) — 0. 

In the sequel we assume that Z„ =^ Z weakly in distribution, where Z has a non- 
extremal stable law with index < a < 2. The next two lemmas do not use the 
assumption I{p) < +oo and may be stated for general distributions from the domain of 
attraction of these stable laws. 



1-5. 
i 
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Lemma 7.1. For all real t, with some constant C depending on p, only, 

mt)\<^\t\- (7.2) 

Proof. The characteristic function i/j^ corresponds to the density Pn{x + dn). Using 
the property '?/'^j(0) = 0, one may write 

m) = j\x-dn){e''^^-'-^-l)pn{x)dx 

j' {x - dn) (e^*(--<^") - l)p{hnx) dx 

where Fi is the distribution function of Xi. Using |e** — 1| < \s\ (s G R), we deduce 
obvious upper bounds 

Wn{t)\ < r^£" (^-'^")''^^^(^) 

< ^'^1 rx-dFAx) + ^d- 

Integrating by parts, we have 

X^ dFi{x) = -bl{l - Fi{bn) + F,{-bn)) + 2 x{l- Fi{x) + Fi(-x)) dx 
■b„ Jo 

< 2 / x{l - Fi{x) + Fi{-x)) dx 
Jo 

and similarly 

\x\dF,{x) = -6„(l-Fi(6„) + Fi(-6„))+ / (l - F,{x) + F,{-x)) dx 

-bn Jo 

rbn 

< / {1 - Fi{x) + Fi{-x)) dx. 
Jo 

Since 1 — (5n — > 1, we get 

KWI ^ ^ f\{l-F,{x) + F,{-x))dx 

+ ^ (^j\l-F,{x) + F,{-x))dxy (7.3) 
with some constant C depending on p. 
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Recall that in the asymptotical formulas (1.3)-(1.4) for Fi, the function B is equiva- 
lent to the slowly varying function h associated with the characteristic function of Xi. 
Thus, with some Cq > 0, Ci > (cq + Ci > 0), we have 

Hence, up to a constant, the first integral in (7.3) does not exceed 

Jo x^-' " ^ "Vo MM 

But, by the well-known result on slowly varying functions ([Se], pp. 66-67), 



^ hisbn) ds ds 1 

r = , as n — >■ oo. 



/o h{bn) s"-i Jo s"-^ 2 -a' 
Therefore, with some constants Ci, C2, 

n 



/ x(l - F,(x) + Fi(-x)) dx < Cib-'^hibn) < 
Jo 



K JO 

where we have applied equation (5.9) of Proposition 5.3, telling us that /i(6„) ~ b'^/n. 

Now, consider the second integral in (7.3). In case a < 1, again by [Se], applied to 
the value a + 1, 

^ hisbn) ds ds 1 

> — = , as n — > oo. 



/o h{bn) Jo s°' I- a 

Hence, using the asymptotic for Fi, the second integral in (7.3) does not exceed, up to 



a constant, 

'0 ^ Jo 

As a result. 



/ 

^0 



h{bn) s° {l-a)n' 

1 / /-bn \^ C 

I [Jo (i-^i(^) + ^i(-^))^^J 



K 

with some constant C, depending on p and a. 

The case 1 < a < 2 is simpler, since then the above integral is convergent on 
the positive half-axis, while the factor ^ behaves like (up to a slowly growing 

sequence), so it decays faster than 1/n. 

Finally, in case a = 1, using the bound h{x) < CsX^, a; > 1 (where e > is any 
prescribed number), we see that, for large n the second integral in (7.3) does not exceed, 
up to a constant. 



1 + M rf^ < cbl/\ 

Ji X - " 
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This yields 

with some constant C depending on the density p. But the ratio behaves hke n~^^^ 
up to a slowly growing sequence, so it decays faster than ^, as well. Thus, in all cases 
- Fi{x) + Fi{-x)) dxf = 0(i). Lemma 7.1 is proved. 

Lemma 7.2. Let 5 e (0, a) and r] G (0, 1) be fixed. There exist positive constants e, 
c, C , depending on p, 5, rj, with the following property: If k > rjn, then 

\Mt)\' = l/nWI' < Ce-'^l*^ for \t\ < sbn. (7.4) 



Proof. This is an analogue of the bound (5.10) for the characteristic functions of 
Zn- In order to prove this upper bound, assume \t\ > 1 and note that 

•^-(^) = rr^ (fiii/bn) SrMt)), t e R. (7.5) 

1 

To proceed, we apply Proposition 5.3. First recall that, according to Karamata's 
theorem, any positive slowly varying function h{x) defined in a; > has a representation 

h[x) — c[x) exp I j — — dy j, x > xq, 

where Xo > 0, c{x) — )■ c 7^ and w{x) — ?> 0, as a; — > +00. For xq — min„>i 6„, 
1 < 1^1 < £bn, where < £ < 1 is fixed, this representation implies 

hibj\t\) 
h{b„ 

Hence, from (5.8)-(5.9) 

\fi{t/bn)\ = eM-c\trb-''h{bn/m < eM-c'\tryn} 

with some constant c' > 0. 

We choose e > to be small enough so that < a — S. Now, applying the above 
estimate in (7.5), we get in the region 1 < |t| < ebn 

\fn{t)\ < -^mt/bn)\+Sn) 

< -^{exp{-c'\tryn} + 6n). 

i--(>n 

One can simplify the right-hand side by noting that ^-^-L — < — < K with some 
constant K. Using logx < x — 1 (x > 0) and < 1 — (1 — e~^) x for < x < we 



> \t\ with 7 = 7(5)= sup |u'(y)|. 

y>\/e 
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then have 

log (exp{-c'|t|"-^/n} + 6n) < exp{-c'|t|"-^/n} + 5„ - 1 

„/U|a-7 „ rnly-|"~T 

< -(l-e-^)^L + 5^ < 

^ ^ n n n 

with positive constants Cj. As a result, 

\fn{t)\ < exp{i(ci-C2|tr-^)} 

with some other positive constants ci and C2 (independent of n). It remains to raise this 
inequality to the power k, and (7.4) follows. 

Wc will now develop a few applications of Lemmas 7.1-7.2 using the assumption 
Hp) < +00. The latter forces p to have bounded variation and vanish at infinity. 
Hence, 

IknllxV = llPnllxV = &n(l " ^n) 1 {|x| <b„} || TV < &n(l " 5„)"^a/7(p). (7.6) 

Using the inequality (3.2), we see that the characteristic functions of p„ and of the 
centered density rn{x) = pn{x + dn) satisfy 

\Mt)\^\fn{t)\<^ {t^O) (7.7) 

with some constant c — c{p), depending on p, only. 

Corollary 7.3. If I{p) < +oo, then under the assumptions of Lemma 7.2 with k> A, 
we have with some constant C depending on p, 5, rj, only, 



—oo 

+ 00 



{l + \t\)\i:'^{t)\dt < C, (7.8) 

't'\M'{t)\'dt < a (7.9) 

Proof. We have M'(t) = ki/j'^it) ijjn(t)''-\ while by (7.2), 



/_: 



t' Km' \Mt)\'^'-'^ dt<^ I t' \Mt)\'^'-'^ dt. 



oo 



To estimate the last integral, first we use (7.4) which gives 

t^\Mt)\''^'~'^dt<C. 



I 

J\t\ 



\<ebn 

For the complementary region \t\> ebn, note that 



1 f^" 

fnipnt)^- ^/ e'^''p{x)dx, 
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which shows that these functions are separated from 1 uniformly in n in \t\ > e. (This 
can easily be seen by using general separation bounds for characteristic functions which 
are discussed in [B-C-G2]). Thus, 

sup |^n(M)| = sup \fn{hnt)\ < C'", 

|t|>£ |t|>£ 

for some constant c > independent of n. In addition, by (7.7), 

t'\Mbnt)\'<^ 

with some other constant. Hence, 

J\t\>ebn J\t\>e 

The last expression is exponentially small with respect to n by the constraint on /c, and 
we arrive at (7.9). The first inequality (7.8), which is simpler, is proved similarly. 

8. Boundedness of Fisher Information. Proof of Theorem 1.1 

In this section we complete the last step in the proof of Theorem 1.1. Keeping the 
same notations as in the previous sections and recalling Lemma 6.1, we only need: 

Lemma 8.1. Assume that Zn =^ Z weakly in distribution, where Z has a non- 
extremal stable law. If I{Zno) < +oo, for some no, then sup„>„(, I{Zn) < +oo. 

In the normal case, when Xi has a finite second moment, the assertion immediately 
follows from Stam's inequality (2.3). In view of Lemma 6.1, we therefore obtain Barron- 
Johnson theorem, i.e., I{Zn\\Z) — > 0. Thus, we may focus on the case < a < 2. 

To simplify the argument and the notations, we assume no = 1 (otherwise, mild 
modifications connected with the binomial decomposition are needed). Thus, let I{p) < 
+00, where p is the density of Xi. As in the previous section, we denote by p„ the 
density of Z„ = Z„ + a„ and assume that Zn ^ Z weakly in distribution, where Z has 
a non-extremal stable law. 

By Stam's inequality (2.3), 

I(Zn)<^^I(p). 

Th 

Although the right-hand side tends to infinity, as n ^ 00, this inequality may be used 
for small values of n, and here it will be sufficient to show that sup„>„^^ I{Zn) < +00, 
for some no- 
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Our basic tool is the binomial decomposition (7.1) of the previous section. Note that, 
by the convexity of the /-functional, 

^^P-) ^ E it) (1 - ^-f^n-'liPn * e-'^*), (8.1) 

and it suffices to properly estimate the terms in this sum. To this aim, we fix a number 
77 e (0, 1) and distinguish two cases. 

Lemma 8.2. If k < n — 3, then 

< Cinbnflip) (8.2) 

with some constant C depending on p, only. 

Proof. By the monotonicity property (2.2), /(p^* * q^~''^*) < I{qlC~''^*)- On the 
other hand, by Proposition 3.2, ii n — k > 3, 

T(n{n-k)*\ < 1 |l^[(n-fc)/3]*||2 , 9 \\n^i^-^)/^]*\W^, . 1 1 «"-*^-2[("-fe)/3]* 11 , ^ 
^\^n ) ^ 2 V" " llTV + ^l|yn l|TV N^n llTV^- 

But the total variation norm decreases when taking convolutions, so that ||^^*||tv ^ 
ll^nllxv (s = 1,2, . . .). Hence, 



In turn, by means of the inequality ||p||tv < \/ I{p) (Proposition 3.1), we have 
where we used the property p(— oo) = p(-|-oo) = for the first inequality. Thus 

Recalling that 5^ ~ ^, Lemma 8.2 is proved. 
Lemma 8.3. If 15 < rju < k < n, then 

l{Pn*Q^n-'^*) < C (8.3) 

with some constant C depending on p and rj, only. 

Proof. Again appealing to the monotonicity of the Fisher information, we will use 
the bound 

Thus, involving the centered density r„(a;) = Pn{x + dn) with the characteristic function 
ipn (a-s in the previous section), it suffices to show that 

Hr'n) = HPn) < C. (8.4) 
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Assume first that rjon < k < n, where < tjo < fj- Since ||r„||Tv < Cbn^/I{p) < +00 
(see (7.6) and Proposition 3.2), the convolution powers r^* have finite Fisher information, 
whenever k > 3. In view of the bound (7.7) on the characteristic functions, we may 
invoke inversion formulas like in (5.6)-(5.7) to write, for any x e R, 

1 



= — I e'^'^MtTdt, (8.5) 



00 
00 



1 r°° 

ir'n)'i^) - - j j''%-it)Utf dt, (8.6) 
1 1"^ 

rl*{x)+x{rl*)\x) - -^j e-''^tkMt)'-'^n{t)dt, (8.7) 

where for reasons of integrability it is safer to assume that A; > 5. 

Corollary 7.3 tells us that the Fourier transforms in (8.5) and (8.7) are well-defined 
for square integrable functions whose L^-norms are bounded by a constant independent 
of k and n. Hence, the same is true for 

X (r^*)'(a:) = -- / e"^*^ {Mt)' + t^V'n(t)'- VU^)) dt, 



—00 



and we may write 



with 



l(C)'(^)l < 



\X\ 



/+00 
Unk{x)^dt<C. (8.9) 
-00 

Moreover, according to (7.8), L^-norms of the functions {—it)ipn{t)'' in (8.6) are also 
bounded by a constant independent of k and n. Hence, 

sup|(r^7(a:)|<C 

X 

for all n and rjon < k < n. As a result, (8.8) may be sharpened to 

\ir'nn^)\< 



1 + 

with some functions Unk satisfying (8.9). By applying Cauchy's inequality, the latter 
immediately implies that 



Fn IItV 



/+00 
\{r'^*yix)\dx<C'\\unk\\2 < C, (8.10) 
■00 



where the resulting constant C may depend on p and rjo (by choosing, for example, 
5 = q;/2 in the previous auxiliary lemmas of the previous section). 
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We now apply Proposition 3.2 to convolutions of any three densities r^*, as above. 
That is, if rjon < kj < n and kj > 5 {j — 1, 2, 3), we obtain by (3.4) and (8.10) that 

Starting with k > 15, put ki = k2 = [|], = k — {ki + k2), so that kj > 5. Also, if 
k > r/n, we have kj > [^] > Hence, we may choose rjo — ^, and thus (8.11) imphes 
(8.3)-(8.4). 

Proof of Lemma 8.1. In the case 15 < ?7n < n — 3, we may combine Lemmas 8.2 
and 8.3 to get from (8.1) with some constant C — C{p, rj) 

0<fc<r/n ^ r]n<k<n ^ ^ 

< C(n6„)2-2"5ii-'')" + C < C, 

where the last inequality holds for all sufficiently large n (by using Sn ~ fi) with, for 
example, rj — ^. Lemma 8.1 and therefore Theorem 1.1 are now proved. 

Remark 8.4. Finally, let us comment on the conditions a)—h) from the Introduction. 
In view of the general bound (3.3), a) is always necessary for the finiteness of I{Zn) with 
some n. Since h) is weaker than a), wc need explain the opposite direction. 

If 1 < a < 2, then Xi has finite first absolute moment C = E \Xi\. Hence, under 
(1.6), the condition (3.5) is fulfilled and thus the bound (3.6) is applicable to all Z„ with 
n>{v-\- 2)/2. More precisely, denoting by gn{t) — fi{tY characteristic function of 
Sn — Xi-\ h we have 

\{t9n)'{t)\ < \gn(t)\ + \t\Mt)\ < {l + Cn\t\)\mr-\ 
thus Sn has a density pn{x) whose total variation norm satisfies 

/ + 0O /■+00 
t'lmi'^'dt / (1 + cn\t\r \mf^--'Ut < +00. 
-00 J —00 

By Proposition 3.2, we get /(5'3„) < +00. 
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